Discovery and recognition of formula concepts using machine learning

نویسندگان

چکیده

Abstract Citation-based Information Retrieval (IR) methods for scientific documents have proven effective IR applications, such as Plagiarism Detection or Literature Recommender Systems in academic disciplines that use many references. In science, technology, engineering, and mathematics, researchers often employ mathematical concepts through formula notation to refer prior knowledge. Our long-term goal is generalize citation-based apply this generalized method both classical references concepts. paper, we suggest how formulas could be cited define a Formula Concept task with two subtasks: Discovery (FCD) Recognition (FCR). While FCD aims at the definition exploration of ‘Formula Concept’ names bundled equivalent representations formula, FCR designed match given assigned unique concept identifier. We present machine learning-based approaches address tasks. then evaluate these on standardized test collection (NTCIR arXiv dataset). approach yields precision 68% retrieving frequent recall 72% extracting name from surrounding text. enable citation within facilitate semantic search question answering, well document similarity assessments plagiarism detection recommender systems.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

the relationship between using language learning strategies, learners’ optimism, educational status, duration of learning and demotivation

with the growth of more humanistic approaches towards teaching foreign languages, more emphasis has been put on learners’ feelings, emotions and individual differences. one of the issues in teaching and learning english as a foreign language is demotivation. the purpose of this study was to investigate the relationship between the components of language learning strategies, optimism, duration o...

15 صفحه اول

an investigation into translation of cultural concepts by beginner and advance student using think – aloud protocols

this research aims at answering the questions about translation problems and strategies applied by translators when translating cultural concepts. in order to address this issue, qualitative and quantitative study were conducted on two groups of subjects at imam reza international university of mashhad. these two groups were assigned as beginner and advanced translation students (10 students). ...

Machine Learning By Using False Discovery Rate

Supervised and unsupervised classification are common topics in machine learning in both scientific and industrial fields, which usually involve three tasks: prediction, exploration, and explanation. False discovery rate (FDR) theory has a close connection to classical classification theory, which must be employed in a sophisticated way to achieve good performance in various contexts. The study...

متن کامل

Machine learning based Visual Evoked Potential (VEP) Signals Recognition

Introduction: Visual evoked potentials contain certain diagnostic information which have proved to be of importance in the visual systems functional integrity. Due to substantial decrease of amplitude in extra macular stimulation in commonly used pattern VEPs, differentiating normal and abnormal signals can prove to be quite an obstacle. Due to developments of use of machine l...

متن کامل

Human Discovery and Machine Learning

Submission to IJCINI This paper studies machine learning paradigms from the point of view of human cognition. Indeed, conceptions in both mahine learning and human learning evolved from a passive to an active conception of learning. Our objective is to provide an interaction protocol suited to both humans and machines, to enable assisting human discoveries by learning machines. We identify the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Scientometrics

سال: 2023

ISSN: ['1588-2861', '0138-9130']

DOI: https://doi.org/10.1007/s11192-023-04667-9